Manuscripts Character Recognition Using Machine Learning and Deep Learning

نویسندگان

چکیده

The automatic character recognition of historic documents gained more attention from scholars recently, due to the big improvements in computer vision, image processing, and digitization. While Neural Networks, current state-of-the-art models used for recognition, are very performant, they typically suffer using large amounts training data. In our study we manually built own relatively small dataset 404 characters by cropping letter images a popular manuscript, Electronic Beowulf. To compensate use ImageDataGenerator, Python library was augment Beowulf manuscript’s dataset. augmented once, twice, thrice, which call resampling 1, 2, 3, respectively. classify efficiently, developed customized Convolutional Network (CNN) model. We conducted comparative analysis results achieved proposed model with other machine learning (ML) such as support vector (SVM), K-nearest neighbor (KNN), decision tree (DT), random forest (RF), XGBoost. pretrained VGG16, MobileNet, ResNet50 extract features images. then trained tested above ML recorded results. Moreover, validated CNN against well-established MNIST Our achieves good accuracies 88.67%, 90.91%, 98.86% cases respectively, Additionally, benchmark accuracy 99.03%

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

the relationship between using language learning strategies, learners’ optimism, educational status, duration of learning and demotivation

with the growth of more humanistic approaches towards teaching foreign languages, more emphasis has been put on learners’ feelings, emotions and individual differences. one of the issues in teaching and learning english as a foreign language is demotivation. the purpose of this study was to investigate the relationship between the components of language learning strategies, optimism, duration o...

15 صفحه اول

Facial Expression Recognition in Older Adults using Deep Machine Learning

Facial Expression Recognition is still one of the challenging fields in pattern recognition and machine learning science. Despite efforts made in developing various methods for this topic, existing approaches lack generalizability and almost all studies focus on more traditional hand-crafted features extraction to characterize facial expressions. Moreover, effective classifiers to model the spa...

متن کامل

Painfree LaTeX with Optical Character Recognition and Machine Learning

Recent years have seen an increasing interest in harnessing advancements machine learning (ML) and optical character recognition (OCR) to convert physical and handwritten documents into digital versions. The increasing adoption of digital documents in academia, however, has provided a new layer of complexity to automatic digitization of physical documents. Compared to typical texts written in n...

متن کامل

Deep Character-level Neural Machine Translation by Learning Morphology

Neural machine translation aims at building a single large neural network that can be trained to maximize translation performance. The encoder-decoder architecture with an attention mechanism achieves a translation performance comparable to the existing state-of-the-art phrase-based systems. However, the use of large vocabulary becomes the bottleneck in both training and improving the performan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Modelling

سال: 2023

ISSN: ['2673-3951']

DOI: https://doi.org/10.3390/modelling4020010